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DISCRETE-EVENT-DYNAMIC-SYSTEM-BASED  APPROACHES  FOR  CONTROL 
IN  INTEGRATED  VOICE/DATA  MULTIHOP  RADIO  NETWORKS 


1  Introduction 

This  report  contains  the  results  from  our  work  under  contract  N000014-92-J-2017  for  the  period 
10/1/92  -  9/30/93.  It  describes  the  progress  made  towards  the  effort  to  develop  and  apply  new 
discrete-event-dyncimic-system-based  techniques  for  the  transmission  scheduhng  problem  in  Radio 
Networks  (RN).  The  report  is  in  two  parts:  in  part  one  we  examine  the  transmission  scheduling 
problem  in  the  context  of  (fixed  length)  data  traffic.  In  contrast  to  our  earher  work,  performed 
imder  contract  N000014-91-J-2025  [3,  4],  which  dealt  with  the  broadcast  scheduling  problem  in  a 
fully  connected  packet  RN,  the  first  part  of  this  report  extends  the  proposed  methodology  therein 
to  general  topology  networks.  In  part  two,  we  look  at  the  scheduling  problem  when  processing 
packetized  voice  caUs^  in  a  N-node  multihop  RN.  As  will  be  seen,  the  different  Grade- of- Service 
(GOS)  requirements  associated  with  voice  and  data  traffic  lead  to  two  decidedly  different  scheduling 
policies.  We  begin  with  a  brief  review  of  the  transmission  scheduling  problem  in  A-node  Radio 
Networks. 

In  such  networks  there  sire  N  nodes  that  wish  to  broadcast  over  a  common  channel.  In  order 
for  a  node  to  successfully  transmit,  the  following  primary  interference  constraint  must  be  satisfied: 
if  node  i  transmits,  then  all  nodes  that  are  within  node  i  ’$  broadcast  zone  must  be  in  a  receiving 
mode.  Thus,  all  such  nodes  j  are  viewed  as  “neighbors”  of  i,  connected  to  i  through  links  {i,j). 
In  addition,  there  may  be  a  secondary  interference  constraint,  whereby  if  i  is  tremsmitting  and 
j  is  coimected  to  i,  then  aU  nodes  k  that  are  also  connected  to  j  must  be  in  a  receiving  mode; 
otherwise,  a  {k,j)  transmission  interferes  with  the  (i,j)  transmission.  In  short,  there  are  (generally 
overlapping)  subsets  of  the  N  nodes,  known  as  transmission  sets,  such  that  aU  nodes  in  such  a  set 
can  simultaneously  transmit  without  conflict  (i.e.  without  violating  the  interference  constrmnts). 
Given  a  slotted  time  hne,  the  problem  becomes  the  assignment  of  each  time  slot  to  a  particular 
transmission  set  [10,  9,  18,  20,  23]  so  that  some  desirable  performance  can  be  attained. 

There  have  been  several  approaches  for  solving  the  transmission  scheduling  problem  (see 
references  in  [3]),  however  here,  as  in  [3,  4],  an  alternate  approach  is  proposed  which  uses  on-line 
adaptation  of  the  schedule  (in  a  stochastic  operating  environment),  to  achieve  the  desired  system 
performance.  An  important  distinction  between  the  existing  hterature  and  the  work  reported  here 
and  in  [4]  is  that  no  distributional  information  at  all  about  the  arrival  processes  at  the  various 
network  nodes  is  required. 

As  described  in  [4],  the  transmission  scheduling  problem  can  be  placed  in  the  context  of  a 
“polling”  system  where  the  channel  is  modelled  as  a  single  resource  (server)  which  provides  service 
to  a  set  C  of  transmission  sets.  At  each  network  node  i,  i  =  I, ...,  N,  there  is  a  customer^  arrival 
stream  characterized  by  arbitrary  interarrival  time  distributions  Gi  with  rate  A;.  Let  A  denote  the 
set  of  arrival  streams.  Note  that  we  allow  for  correlation  within  an  individual  queue  arrival  stream, 

*In  this  report  it  is  assumed  a  voice  call  is  comprised  of  an  arbitrary  number  of  fixed-length  packets,  with  constant 
interarrival  times. 

^The  term  “customer”  can  refer  either  to  a  data  packet  or  to  an  entire  voice  caO. 
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as  well  as  between  streams.  A  customer  from  a  particular  arrival  stream  is  allowed  to  join  one  of 

several  prespecified  transmission  sets  associated  with  that  stream.  Therefore,  a  transmission  set 


Cj  is  such  that  Cj  C  .4  and,  in  general,  C,-  fl  Cj  / 1  for  ifj.  The  importance  of  the  distinction 


a  transmission  set,  it  can  serve  multiple  customers  simultaneously,  one  from  every  sirrival  strecim 
that  can  belong  to  this  set.  As  an  illustration,  consider  the  four-node  RN  with  three  transmission 
sets  shown  in  Figure  la.  The  equivalent  polling  representation  is  shown  in  Figure  lb,  where,  a 
customer  from  arrival  stream  2,  for  example,  may  join  either  transmission  set  1  or  2.  In  the  latter 
model,  a  customer  is  held  at  the  head  of  its  arrival  stream  queue  until  the  server  is  assigned  to 
either  transmission  set  1  or  2;  it  is  then  instantaneously  transferred  to  the  appropriate  queue  and 
served  (while  another  customer  from  another  arrival  stream  may  also  receive  service  at  the  same 
time).  We  assume  that  time  is  slotted  (with  slot  size  set  equal  to  the  packet  transmission  time), 
such  that  a  scheduling  decision  is  made  at  the  beginning  of  every  slot.  The  rule  by  which  the  server 
is  assigned  to  a  particular  queue  is  known  as  a  polling  policy. 

The  choice  of  an  appropriate  polhng  policy  is  governed  in  part  by  the  traffic  GOS  require¬ 
ments.  Data  traffic  is  characterized  by  a  tolerance  in  the  end-to-end  packet  delays,  but  no  tolerance 
in  packet  loss.  Therefore,  in  packet  RN  we  assume  infinite  buffers  at  each  network  node  and  such 
that  packets  can  be  queued  with  no  additional  penalties.  Our  objective  is  then  to  select  a  polling 
policy  so  as  to  minimize  the  mean  packet  waiting  time.  In  contrast,  packetized  voice  traffic  can 
tolerate  some  loss  (without  a  severe  deterioration  in  the  voice  playback),  however  it  is  sensitive  to 
end-to-end  delays.  In  addition,  it  is  desirable  that  packets  reach  the  destination  in  the  order  in 
which  they  are  transmitted.  Assiime  that  the  desired  GOS  for  voice  calls  can  be  expressed  in  the 
form  of  a  stringent  upper  bound  on  the  end-to-end  delay  of  packets  belonging  to  a  voice  call.  Thus 
we  deal  with  the  problem  of  serving  delay-sensitive  traffic  in  the  following  sense:  packets  cannot 
be  queued  for  more  than  n  time  slots,  where  n  is  fixed.  Thus,  upon  arrival,  a  voice  call  is  either 
accepted  if  this  constraint  can  be  satisfied,  or  it  is  blocked.  Blocked  ceiUs  are  assvuned  to  be  lost 
from  the  system,  a  mode  of  operation  known  as  ‘blocked  calls  cleared’  The  objective  then  is  to 
determine  the  polling  policy  that  minimizes  the  probabiUty  that  a  voice  call  is  blocked. 

For  RN’s  with  data  traffic,  as  discussed  in  [3],  we  consider  a  non-cycfic  polling  policy,  in 
particular,  a  non-work  conserving  probabilistic  policy  with  zero  switchover  times,  where  the  current 
time  slot  is  assigned  to  transmission  set  i  with  probability  6,.  We  refer  to  this  pohcy  as  ‘random 
polling’.  The  6i’s  then  represent  control  parameters,  which  can  be  gradually  tuned  or  optimally 
selected  so  as  to  achieve  a  desired  system  performance  In  scheduhng  delay  sensitive  voice  traffic 
we  consider  a  cyclic  policy,  where  the  polling  cycle  is  specified  by  a  frame  comprised  of  n  slots,  such 
that  a  slot  is  uniquely  assigned  to  a  particular  transmission  set.  Thus,  a  frame  is  characterized  by 
a  vector  [ci,  •  •  •,c„]  where  Ci  6  {1,  ■  •  -  ,  M}  is  the  transmission  set  assigned  to  slot  i  and  M  =  \C\ 
where  |  •  |  is  the  cardinahty  of  a  set.  Note  that  the  frame  structure  allows  us  to  guarantee  an  upper 
bound  on  the  intervisit  time  to  any  transmission  set.  In  the  case  of  delay-sensitive  traffic,  we  can 
therefore  ensure  that  no  transmission  set  ever  waits  for  more  than  n  time  slots. 

In  contrast  to  random  polling,  the  scheduhng  problem  under  cyclic  polling  is  formulated  as  a 
stochastic  discrete  optimization  problem  over  a  finite  set ,  and  is  an  instance  of  the  general  stochastic 

policy  is  work  conserving  or  nonidling  if  the  server  is  assigned  to  a  transmission  set  if  and  only  if  there  are 
customers  present  in  one  of  the  arrival  streams  belonging  to  that  transmission  set. 
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Figure  la :  4-Node  RN 
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Figure  lb:  Multiclass  Polling  System 


resource  allocation  problem  where  n  resources  (slots)  are  to  be  assigned  to  M  users  (transmission 

sets).  Rather  than  solve  the  original  discrete  optimization  problem,  which  can  easily  become 

combinatorially  explosive,  we  propose  a  new  approach  [6]  whereby  we  transform  the  former  into  an 
auxiliary  continuous  optimization  problem,  which  in  turn  can  be  solved  quite  efficiently  through 
various  gradient-based  algorithms.  Finally,  the  choice  of  a  state-independent  scheduling  policy  is 
motivated  by  the  simplicity  of  the  controller,  i.e.,  the  fact  that  no  monitoring  of  state  information 
(queue  lengths)  is  involved.  In  fact,  in  many  complex  systems,  including  radio  networks,  it  is  often 
the  case  that  state  information  is  either  impossible  to  obtain  or  it  may  not  be  up-to-date  for  a 
real-time  controller  to  use. 

The  remainder  of  the  report  is  organized  as  follows.  In  section  2  we  present  a  formulation  of 
the  transmission  scheduling  problem  with  overlapping  transmission  sets  in  the  presence  of  data  traif- 
fic.  This  formulation  reveals  the  need  for  estimates  of  the  sensitivities  (gradient)  of  the  performance 
measure  with  respect  to  the  slot  assignment  probability  pcirameters  0,-,  based  on  which  simple  on¬ 
line  optimization  algorithms  are  proposed.  We  present  one  such  estimator  for  the  case  where  the 
performcince  measure  is  the  mean  packet  waiting  time.  In  section  3,  we  present  a  gradient-based 
optimization  algorithm  for  obtaining  optimal  slot  assignment  probabibties  using  our  approach  for 
several  networks,  including  examples  illustrating  the  adaptive  features  of  the  approach.  In  section 
4,  our  aim  is  to  use  our  approach  to  subsequently  obtain  a  deterministic  (non-randomized)  sche¬ 
duling  policy,  thus  avoiding  the  randomization  in  the  slot  assignment  process.  We  develop  such  a 
deterministic  policy  using  the  Golden  Ratio  policy  proposed  in  [15],  and  compcire  it  with  several 
other  policies,  including  a  simple  Round  Robin  one.  In  section  5  we  consider  the  problem  of  serving 
delay-sensitive  voice  traffic.  In  section  6  we  formulate  the  optimal  scheduling  problem  as  a  stocha¬ 
stic  resource  allocation  problem,  and  describe  how  we  transform  the  ensuing  discrete  optimization 
problem  to  a  continuous  optimization  problem.  This  formulation  reveals  the  need  for  estimates  of 
the  sensitivity  (gradient)  of  performance  metrics  with  respect  to  the  control  parameters.  In  section 
7  we  derive  such  estimators  based  on  the  ‘marked  slot’  approach  proposed  in  [4],  and  develop  ex¬ 
plicit  algorithms  for  gradient  estimators  of  blocking  probability  under  two  modes  of  delay-sensitive 
operation.  In  section  8  we  present  a  convergence  result  which  delineates  the  relationship  between 
the  discrete  optimization  problem  and  the  auxiliary  continuous  optimization  problem.  In  sections 
9  and  10,  we  illustrate  our  methodology  by  discussing  two  applications  which  can  be  modelled  as 
resource  allocation  problems.  Lastly  in  section  11  we  discuss  ongoing  research  problems. 

2  Transmission  Scheduling  Problem  Formulation  with  Overlap¬ 
ping  Transmission  Sets 

Consider  a  iV-node  RN  with  data  traffic.  At  each  node  i,  i  =  1,...,^,  there  is  a  process  charac¬ 
terizing  data  arrivals,  where,  as  before,  we  assume  an  arbitrary  interarrival  distribution  for  data 
packets  and  allow  for  correlations  between  arrivals.  From  a  terminology  standpoint,  we  will  use 
“node”  and  “arrival  stream”  interchangeably.  Our  starting  point  is  to  assume  that  a  finite  munber 
of  transmission  sets  (M)  has  been  identified  such  that  aU  nodes  in  the  RN  belong  to  at  least  one 
such  set.  Moreover  let  there  be  one  or  more  overlapping  transmission  sets  (i.e.,  a  packet  from  some 
arrival  stream  (node)  may  be  served  in  transmission  set  i  or  j  ^  i).  Let  Zi  be  the  set  of  transmission 
sets  to  which  the  ith  arrival  stream  belongs.  For  example,  in  the  4-node  network  shown  in  Figure 
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la,  node  2  belongs  to  transmission  sets  1  and  2  so  that  —  {1,2}.  Similarly  Z3  =  {2,3}  and  so 
on. 

Let  us  focus  on  average  packet  waiting  time  as  the  cost  function  of  interest,  and  let  0;, 
z  =  1,  •  •  • ,  M  be  the  probability  that  a  slot  is  assigned  to  transmission  set  i  (assignments  are  made 
independently  from  one  slot  to  the  next).  We  define  0  to  be  the  M-dimensional  slot  assigimient 
probability  vector  [0i,  •  •  and  Wi(0)  as  the  expected  node  i  waiting  time.  Our  objective  is 

to  determine  the  optimal  slot  assignment  vector  so  as  to  minimize  a  weighted  sum  of  the  mean 
packet  waiting  times  associated  with  each  arrival  stream,  subject  to  normalization  and  stability 
constraints.  Thus,  the  optimization  problem  is  formulated  as: 

Problem  PI: 


N  M 

rmn  ^C'iW,(0)  s.t.  ^  =  1  (1) 

i=i 

where  Ci  is  the  weight  associated  with  stream  i.  In  the  above  problem  formulation  we  do  not 
explicitly  include  the  stability  constraint,  rather,  we  take  this  into  account  in  the  final  optimal 
assignment  by  simply  ensuring  that  every  adjustment  made  to  6i  never  violates  this  constraint. 
Observe  that  the  mean  waiting  2;de  Wi  is  a  function  of  the  probability  that  stream  i  is  assigned  a 
transmission  slot,  which  is  given  by  6j-  This  motivates  the  definition  of  a  new  JV-dimensioncil 

control  vector  §  with  elements 


<t>i=  i  =  !,•••,  iV 

ieZi 


(2) 


that  is,  <f>i  is  the  probability  that  a  time  slot  is  assigned  to  a  transmission  set  that  conteiins  stream 
i  (and  hence  a  packet  from  stream  i,  if  present,  is  served).  With  the  introduction  of  $,  we  can 
rewrite  PI  as 
Problem  P2: 

N 

imn s.t.  5i(f)  =  Ci,---,5„($)  =  c„  (3) 

where  we  replace  the  single  normalization  constreiint  by  one  or  more  constraints  on  the  auxiliary 
variables  which  we  obtain  from  the  N  equations  of  the  form  (2).  From  the  theory  of  Lagrangian 
optimization,  there  are  simple  Kuhn- Tucker  conditions  that  provide  necessary  conditions  for  op¬ 
timality.  If  a  Lagrange  mcdtiplier  is  associated  with  each  constraint  in  P2,  cifter  ebminating  the 
multipliers  we  obtain  conditions  which  are  typically  of  the  form 


h  ‘ 


K 


dWk{<l>k) 

d(pk 


(4) 


for  some  K,  L  <  N .  In  the  absence  of  analyticcd  expressions  for  Wi{4>i),  i  =  we  can 

use  standard  gradient-based  optimization  algorithms  (hiU-climbing  or  steepest-descent)  [21,  8]  to 
find  the  optimal  provided  the  gradient  in  (4)  is  available.  The  corresponding  0'  can  then  be 
obtained  as  a  solution  of  the  N  linear  equations  (2).  Alternatively,  since  the  (pi's  are  themselves 
variables  dependent  on  0,  we  may  choose  to  control  the  6i's:  thus  the  minimization  may  be  done 


5 


by  adjusting  0  directly;  an  example  will  be  given  in  the  next  section.  The  coupling  between  the 

transmission  sets  is  now  reflected  in  the  adjustments  of  ^,’s  so  as  to  achieve  the  derivative  balance 

required  by  (4). 

The  preceeding  discussion  motivates  us  to  seek  techniques  for  estimating  — by  simply 
observing  the  actual  system  in  operation  (or  by  simulating  it).  Along  these  Unes,  two  approaches 
which  have  been  proposed  are  Perturbation  Analysis  (PA)  (e.g.  [11,  14])  and  the  Likehhood  Ratio 
(LR)  methodology  (e.g.  [19, 12]).  In  this  report,  we  apply  PA  techniques  to  estimate  the  sensitivities 
in  question.  The  main  idea  in  PA  is  to  estimate  through  the  sample  derivative 

where  Li(^{(f)i,  u>))  is  &  sample  function  obtained  when  a  particular  sample  path  £(^i,  a;)  is  observed, 
and  Ji{<f>i)  =  £[T,(£((^i, a;))].  Here,  uv  is  an  element  of  the  underlying  probabihty  space  Q,  which 
is  taken  to  be  [0, 1]“;  thus,  a;  is  simply  viewed  as  a  sequence  of  uniformly  distributed  random 
variables  in  [0,1].  A  sample  derivative  obtained  through  this  approach  is  an  unbiased  estimate  of 
provided  that  [2] 

(5) 

Since  the  form  of  the  derivative  estimator  is  the  same  for  each  transmission  set,  let  us  concentrate 
on  an  isolated  transmission  set.  In  particular,  the  queueing  system  corresponding  to  transmission 
set  i  behaves  like  a  modified  G/D/1  queue  wdth  vacations,  where  vacations  correspond  to  periods 
when  the  server  is  imavailable  to  transmission  set  i  (i.e.  the  server  is  busy  serving  packets  from 
other  transmission  sets)  and  service  is  synchronized  to  coincide  with  the  start  of  a  slot.  That  is,  at 
the  start  of  every  interval  of  S  time  units  (i.e.,  each  slot),  the  server  is  available  to  transmission  set 
i  and  with  probability  (1  —  ^,)  the  server  goes  on  vacation.  A  slot  that  is  assigned  to  transmission 
set  i  is  termed  a  ‘transmission  slot’,  otherwise  it  is  termed  a  'vacation  slot’.  We  used  the  term 
modified  G/D/1  above,  because  an  arriving  customer  must  cdways  wait  until  the  end  of  the  slot 
within  which  it  arrives,  even  if  the  server  is  avciilable  (normally,  this  customer  would  immediately 
start  service). 

The  key  idea  on  which  the  estimator  is  based  is  to  evaluate  the  effect  of  altering  a  given 
schedule  (sequence  of  slot  assignments)  by  converting  one  vacation  slot  to  a  transmission  slot  on 
an  observed  sample  path.  The  vacation  slot  added  to  some  hypothetical  new  schedule  is  termed  a 
phantom  slot.  In  presenting  the  estimation  algorithm  we  define  the  following  sample  path  quantities 
(based  on  a  continuous-time  model): 

•  K  -  Observation  interval  in  number  of  busy  periods 

•  N  -  Number  of  node  i  packet  arrivals  during  observation  interval 

•  j  -  Arrival  epoch  of  the  ith  packet  in  the  jth  busy  period 

•  Tij  -  Time  instant  when  the  zth  transmission  slot  in  the  jth  busy  period  begins 

•  Vi  j  -  Time  instant  when  the  ith  vacation  slot  in  the  jth  busy  period  begins 

•  Vj  -  Number  of  vacation  slots  in  the  /th  busy  period. 

•  bj  -  Number  of  packets  in  the  jth  busy  period. 
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•  Pij  [Tij]  •  Packet  index  of  the  first  [Zasf]  packet  affected  by  the  removal  of  the  ith  vacation 
slot  in  the  jth  busy  period. 

Figure  2  illustrates  the  aforementioned  notation  where  vacation  slots  axe  shown  darkened, 
arrivals  are  represented  by  arrows  above  the  horizontal  time  axis,  and  departures  by  arrows  below 
the  time  axis.  In  Figure  2,  phantomizing  (i.e.,  removal)  of  the  2nd  vacation  slot  from  the  mth 
busy  period  introduces  a  service  slot  between  the  1st  and  2nd  original  service  slots  and  causes  all 
remaining  customers  in  the  busy  period  to  begin  service  one  service  epoch  earher.  Thus,  the  first 
customer  affected  by  this  additional  service  slot  is  the  second  arrival  i.e.,  fi2m.  =  2.  On  the  other 
hand,  the  last  customer  to  be  affected  is  72m  =  3.  Finally,  as  a  consequence  of  the  phantom  slot, 
the  busy  period  in  the  nominal  sample  path  has  been  spht  into  two  busy  periods  in  the  perturbed 
sample  path. 

We  now  state  the  following  theorem  (the  proof  can  be  found  in  [7]): 


[Vij  -  r,,,.,-] 

m  =  1 ,  •  •  • ,  6  j  —  1 

m  =  fiij,  •  •  • ,  6,- 

and,  by  convention,  =  Aij+i 

Observe  that  the  first  double  summation  (over  the  Vij  terms)  is  simply  the  sum  of  the  vacation 
slot  epochs,  and  in  addition  all  remmning  quantities  are  directly  measurable  from  the  sample  path. 
An  algorithm  for  implementing  an  estimator  based  on  Theorem  1  can  be  foimd  in  the  appendix  of 
[7].  From  an  implementation  viewpoint  memory /computational  overhead  is  minimized  by  noting 
that  the  majority  of  the  sample-path  quantities  (in  particular  Tij  eind  Vij)  need  to  be  stored  only 
for  the  duration  of  a  busy  period. 

Finally,  note  that  an  estimator  based  on  the  dual  process  of  converting  a  transmission  slot  into 
a  vacation  slot  can  also  be  similarly  developed.  We  refer  to  transmission  slots  to  be  removed  from 
the  nominal  system  sample  path  as  marked  slots.  Although  marking  (i.e.,  converting  a  transmission 
slot  into  a  vacation  slot)  Ccin  be  viewed  as  the  dual  process  of  phantomizing,  there  arise  some 
fundamental  differences  between  the  Phantom  Slot  and  Marked  Slot  estimator.  Most  notably,  in 
the  Phantom  Slot  estimator,  perturbational  effects  are  locahzed  to  the  busy  period  within  which 
they  originate;  as  a  result,  implementation  and  analysis  can  be  carried  out  for  every  busy  period  in 
isolation.  Alternatively,  in  the  Marked  Slot  estimator,  the  result  of  a  perturbation  due  to  marking 
a  service  slot  in  some  busy  period  can  propagate  to  subsequent  busy  periods  and  thus  busy  periods 
in  the  perturbed  sample  path  may  coalesce. 


Theorem  1  An  unbiased  estimate  of  is  given  by: 


'dWi' 


j  K 

fiij  =  argm^{m  :  Tm,j  <  Vij)  +  1 
=  ar^minlm  :  <  Tij) 


where 
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Figure  2a  -  Nominal  Sample  Path 


BP#m 


BP#(m+l) 


Figure  2b  -  (2,m)  Phantom  Slot  Sample  Path 
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3  A  Schedule  Optimization  Algorithm 

Now  that  we  have  at  our  disposal  the  derivative  estimators  required  for  solving  the  optimization 
problem  (P2),  we  may  proceed  to  develop  a  gradient-based  algorithm  for  obtaining  the  optimal 
schedule  (i.e.,  the  optimal  slot  assignment  probability  vector  0*  =  [0^^ . . .  ^  for  a  system  with 
M  transmission  sets).  In  what  follows,  we  limit  ourselves  to  one  such  algorithm,  similar  to  one 
used  in  [17],  simply  to  illustrate  our  approach.  Looking  at  (4),  we  see  that  our  cdgorithm  is  such 
that  it  seeks  to  balance  sums  of  derivatives.  The  main  idea  in  our  algorithm  is  the  following:  at 
every  iteration,  find  the  maximum  imbalance  condition  and  adjust  the  variables  6i  so  as  to  reduce 
it. 

We  will  proceed  by  considering  two  examples,  one  for  the  simple  case  where  C  =  and  one 
for  the  general  case  C  ^  A.  Finally,  it  should  be  clear  that  the  sample-path-based  nature  of  the 
derivative  estimators  on  which  our  optimization  approach  relies  possesses  some  inherent  “adaptive” 
properties  in  the  following  sense:  if  the  characteristics  of  various  arrival  processes  change,  then  the 
derivative  estimate  also  adjusts  itself  to  reflect  such  a  chainge  from  the  observed  data.  The  third 
example  in  this  section  is  intended  to  illustrate  these  adaptive  properties  by  allowing  the  topology 
of  a  system  to  change.  For  brevity  we  focus  on  the  problem  formulation  and  construction  of  the 
optimization  schemes;  simulation  resiilts  and  detciiled  discussions  can  be  foimd  in  [7]. 

1.  A  Two-Node  RN  with  C  =  A.  In  this  example,  we  consider  the  transmission 
scheduling  problem  with  two  arrival  streams  and  non-overlapping  transmission  sets.  Our  objective 
is  to  determine  the  optimal  slot  assignment  probabilities  so  as  to  minimize  the  mean  packet  waiting 
time.  For  this  simpUfied  case,  the  optimization  problem  is  reduced  to  balancing  the  derivatives  of 
the  individual  packet  waiting  times.  Let  r*  be  the  kth  observation  (iteration)  interval  over  which 
we  estimate  the  derivative  dWi/dOi  and  0^*^  =  the  probabiHty  vector  over  this 

interval®.  At  the  end  of  the  fcth  interval  we  compare  individual  node  derivatives  and  set  = 
cU-gmin^fdWj/d^i].  The  slot  assignment  probabilities  are  adjusted  so  as  to  increase  the  probability 
associated  with  i*  (and  hence  reduce  the  probability  associated  with  the  other  arrival  stream).  In 
this  simple  example,  we  need  only  control  a  single  probability.  In  particular,  consider  the  control 
of  6i ,  and  let  the  amount  of  change  for  the  Ath  interval  be  6k  where 

-  dW-i.  dW2 

(k\  ^  ^k 

de[^^  de['‘> 

and  T]k  is  a  “step  size”  parameter®.  At  the  end  of  the  feth  observation  interval  we  update  the  node 
1  probability  as  follows: 

[  «S‘>  +  4  if1Si>^and  (i5feT<l) 

if 

.  otherwise 

^When  C  =  A  there  is  only  one  arrival  stream  in  each  transmission  set. 

^For  this  simple  network  9i  is  the  probability  that  a  transmission  slot  is  assigned  to  arrival  stream  i. 

®The  step  size  sequence  7}k  must  be  properly  chosen  to  guarantee  convergence  [8].  Intuitively,  use  of  an  excessively 
small  step  size  will  result  in  slow  convergence,  whereas  use  of  an  excessively  large  step  size  can  result  in  oscillations 
and  thus  lack  of  convergence. 


9 


where  a  decrease  in  node  1  or  node  2  probability  is  permitted  if  and  only  if  at  the  new  operating 
point  we  do  not  violate  the  stability  requirement.  Finally,  we  assume  a  fixed  step  T]k  =  0.001 

and  adjust  the  observation  interval  according  to  r*  =  kLo  with  Lq  =  100  node  1  busy  periods.  The 
convergence  of  such  an  optimization  scheme  has  been  shown  in  [21], 

2.  A  Four-Node  RN  with  C  ^  A.  Consider  the  4-node  RN  shown  in  Figure  la  where 
the  objective  is  to  minimize  the  average  packet  waiting  time  in  the  network.  As  was  discussed 
in  the  introduction,  we  can  translate  this  broadcast  scheduling  problem  into  a  polling  model  with 
3  overlapping  transmission  sets.  Using  the  notation  introduced  in  section  2,  we  have  —  {!}, 
Z2  =  {1,2},  Zz  =  {2,3},  Z4  =  {3}.  The  optimization  problem  can  be  formulated  as: 

4  3 

min  s.t.  ^  =  1  (6) 


where  0  =  and  /i(A,)  is  the  fraction  of  arrivals  to  the  entire  network  that  arrive  at 

node  i  (typically  a  fimction  of  node  arrival  rate)^.  Introducing  the  new  variables  (pi,  i  =  1,  •  •  -  ,4, 
observe  from  the  specification  of  the  Zi  sets  that  =  ^1,  ^2  =  ^1  +  ^2)  <^3  =  ^2  +  ^3,  and  (p4  =  63. 
Then,  based  on  these  variables  and  the  normahzation  condition,  we  can  reformulate  the  problem 


<^1 + ^3  - 1 
^2 + ^4 = 1 


Using  Lagrangian  relaxation  and  ehminating  Lagrange  multiphers,  we  immediately  obtaiin  the 
following  necessciry  conditions  for  an  optimcd  point: 

^W^  dW-K 

Awy  =  (T) 

(8) 

£7^2 

The  optimization  problem  is  then  reduced  to  independently  balancing  two  pairs  of  derivatives. 
We  proceed  as  before  by  adopting  a  scheme  so  that  at  each  iteration  we  minimize  the  maximum 
difference  in  the  pairs  of  derivatives  involved.  Recall  that  the  minimization  is  over  the  slot  assign¬ 
ment  probabihties,  and,  as  before,  let  0^^^  be  the  slot  assignment  probabihty  vector  over  the  kth 
observation  interval,  where  the  duration  of  the  latter  is  transmission  set  1  busy  periods.  Over 
this  interval  each  node  i  estimates  its  derivative  At  the  end  of  the  interval,  there  are  two 


Case  1:  If  /i' 


then  we  update  the  slot  assignment  proba- 


^To  simplify  notation,  we  omit  the  dependence  on  A;  when  there  is  no  ambiguity. 
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bilities  as  follows: 


0(fc+i)  _ 


with 


0^  _  A'';  4*^^  +  A'' 

-  A*=;  4^^  + 

4^^  _  A*:.  /)(*') 


if  f/i 
if  f/i 

A'';  4''^  +  AV2;  4*^  +  A*^/2  if  (/i 


A*=;  4*^^  +  A* 


if  /i 


S^-i 

01^1 

aw^ 

d4>i 

ewi 

dtpi 


f  §31 

.  f  §31 

/s  3,^3 

/  f  §y3i 

^  -fs  9,^3 


/2  9^2 

and  Ifz 


'  dWr  dW^ 

r  d(j>i  dh 


X  Vk 


where  T]k  is  the  step  size  parameter.  To  ensure  convergence,  the  step  size  sequence  must  be  chosen 
to  satisfy  certain  standard  conditions  (e.g.,  [8]).  One  family  of  sequences  which  satisfies  these 
conditions  is 


Vk  = 


MkP 


where  M  is  a  normahzation  constant  and  0  <  p  <  1.  Finally,  note  that  in  (9)  only  components 
of  0  that  change  at  the  (k  +  l)th  iteration  are  explicitly  shown;  aU  remaining  components  are  the 
same  as  in  the  prior  iteration. 


Case  2:  If 


f  _  f 


00: 


f  §31 
\Ji  9<^ 


_  f  §31 

0<^4 


then; 


0(^+1)  = 


with 


4''^  -  A'' 
4^^  -  A*= 
4*“^  -  A*' 

-  A'' 


4^“^  +  A* 

4*“^  +  A''/2;  4*^^  +  A*/2 
+  A*= 

4*) 


FA* 


if 

if 

if 

if 


aw, 
0^1 
aw, 

d<S>i 

aw, 

d<i>i 

r  f  aw, 

1  •'1  01^1 


/r 

r 

/i- 


f  §31 

^  /s  9,^3 
^  /s  9^3 


>  fa 


aw, 

a<t>3 

aw^ 


-fs  903 


and 

and 

and 

and 


(f  §31 

\J^  a(t>2 

( f  0Wi 

l/2  902 

aW'i 

a<h 

awi 

a<h 


\h 


>^Sf 

<f^Wt 


(10) 


A* 


51^2  ,  dW4  1 


Finally,  the  length  of  the  observation  interval  is  controlled  as  follows: 


Tfc  =  kLo 


(11) 


where  Lq  is  an  a  priori  chosen  number  of  busy  periods.  The  aforementioned  optimization  procedure 
incorporates  both  step  size  as  well  as  sample  size  control®  [21].  Simulation  results  under  asymmetric 
traffic  loads  can  be  found  in  [7]. 

®An  alternative  to  controlling  the  step  size  sequence  is  to  increase  the  estimation/observation  horizon  at  each 
iteration.  This  is  referred  to  as  “sample  size  control.”  The  convergence  of  such  a  scheme  has  been  formally  shown  in 
[21].  An  informal  argument  is  as  follows:  If  the  estimators  are  consistent  (which  we  conjecture  to  be  the  case),  then 
as  the  observation  interval  increases,  the  sample  derivative  converges  to  the  derivative  of  the  expected  waiting  time. 
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3.  Adaptivity  properties  when  topology  changes.  In  this  example,  we  illustrate  the 
adaptivity  of  our  algorithm  subject  to  a  change  in  the  topology  of  the  RN  considered  in  the  last 
example.  In  particular,  consider  once  again  the  4-node  RN  in  Figure  la  and  assume  that  a  new 
node  wishes  to  participate  in  the  transmission  process.  Moreover  the  “incoming”  node  (denoted 
as  node  5)  is  located  so  that  it  is  within  the  broadcast  zone  of  nodes  1  and  2,  i.e.,  node  5  is  a 
neighbor  of  both  nodes  1  and  2.  The  resultant  topology  is  shown  in  Figure  3a  by  the  inclusion  of 
the  dotted  hnes.  The  effect  of  the  change  in  topology  is  to  alter  the  transmission  sets;  the  new 
transmission  sets  are  now  {(1,2),  (2, 3),  (3, 4,5)}  and  the  resultant  polling  model  is  shown  in  Figure 
3b.  The  change  in  transmission  sets  gives  rise  to  modified  optimahty  equations,  compared  to  the 
last  example.  In  particular,  the  new  Kuhn- Tucker  optimahty  conditions  are: 


/i(Ai) 

hi.^7) 


dWi 

d4>i 

dW^ 


-  h{^z) 

—  74(^4) 


dWs 

d(f>3 

dW. 


+  fsi^s) 


dWs 


(12) 

(13) 


d4>2  '  '  d4>4  d(l>s 

A  similar  optimization  scheme  as  that  of  example  2  can  be  now  constructed  to  achieve  the  desired 
derivative  balance.  Again,  a  discussion  of  the  simulation  results  can  be  found  in  [7]. 


4  A  Deterministic  Scheduling  Policy 


In  practice,  a  randomized  scheduhng  poUcy  may  not  be  desirable  either  because  it  involves  a  random 
number  generation  process  or  because  it  may  be  required  that  the  time  between  successive  server 
visits  to  a  transmission  set  be  bounded  by  a  deterministic  constant.  In  this  section,  we  use  our 
approach  for  obtaining  optimal  slot  assignment  probabihties  in  conjunction  with  the  Golden  Ratio 
(GR)  pohcy  presented  in  [15]  to  identify  a  deterministic  scheduhng  pohcy  with  certain  optimahty 
properties.  Our  contribution  is  therefore  to  optimize  the  Golden  Ratio  policy.  We  then  provide 
simulation  results  aimed  at  comparing  this  pohcy  to  several  others,  including  our  random  polhng 
approach  developed  in  the  previous  sections.  Note  that  in  specifying  a  deterministic  pohcy,  we  are 
seeking  the  specification  of  a  scheduhng  frame:  a  frame  consists  of  a  fixed  number  F  of  time  slots 
with  each  slot  allocated  to  a  given  transmission  set.  The  frame  repeats  itself  in  time,  thus  defining 
a  periodic  structure  through  which  the  server  is  assigned  to  each  transmission  set.  The  number  F 
of  slots  in  the  frame  is  chosen  according  to  a  desired  upper  bound  on  transmission  set  intervisit 
times.  The  problem  we  address  here  is  how  to  aUocate  the  F  slots  to  the  M  transmission  sets. 


The  Golden  Ratio  Scheduling  Policy.  Let  us  briefly  summarize  this  pohcy  as  presented 
in  [15].  As  before  consider  M  transmission  sets  and  let  6i  be  the  desirable  fraction  of  slots  to  be 
assigned  to  transmission  set  i.  Let  F^  —  be  the  kth  Fibonacci  number,  where  (p~^  = 


(^/(5)  -  l)/2  %  0.618034.  Then,  let  Nl,  i 

<  Nl  <  [0.Ffcl  and 


v/(5) 

1,  •  •  • ,  M  be  integers  such  that: 


M 


Fk 


(14) 


>■=1 


with  =  0.  Thus 


lim  =  di 

k-^00 
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node  3  node  1  node  4  node  2 

Figure  3a:  RN  Subject  to  a  Topology  Change 


Switching  is  synchronized 
with  slot  allocation 


Figure  3b:  Modified  Multiclass  Polling  System 
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For  each  k  the  GR  policy  assigns  N]^  slots  to  transmission  set  i  and  attempts  to  distribute  the 
allocation  of  slots  uniformly  over  a  frame  of  size  Fk.  Note  that  although  in  the  limit  k  -*  oo  visits 
to  transmission  set  i  axe  equally  spaced,  because  of  the  generally  overlapping  trcinsmission  set,  visits 
to  a  particular  node  need  not  be  equally  spaced. 

Let  frac[x)  =  x  -  [x],  aj  =  frac{jip~^)  and  =  {^j\j  =  0>  •  •  •  >  Ffc  -  1},  where  the 
nth  (1  <  n  <  Fk)  smallest  element  of  Ap^,  is  associated  with  the  nth  slot  in  the  frame.  The  final 
step  in  the  GR  policy  is  then  one  which  assigns  to  class  i  the  following  slots  within  a  frame: 


E  <  i  <  E 


m=l 


(15) 


Note  that  k  determines  the  desired  length  of  the  scheduling  frame,  which  we  assume  is  given.  Our 
task  is  to  specify  the  optimal  S,-,  i  =  Once  this  is  done,  one  can  use  (14)  and  (15)  to 

allocate  slots  in  the  frame.  Finally,  observe  that  if  all  0,  are  equal  then  GR  reverts  to  the  simple 
Round  Robin  policy  discussed  next.  We  illustrate  the  construction  of  the  GR  policy  through  the 
following  example. 

Example:  To  illustrate  the  procedure  consider  the  GR  policy  corresponding  to  the  optimal 
slot  allocation  probabilities  6i  =  0.32;  02  =  0.18  and  03  =  0.50  .  Let  k  =  7  so  that  we  define  a 
frame  of  length  F7  =  13,  and,  using  these  values  of  0i,  02,  03  in  (14),  we  get  =  4,^7  =  3 
and  Nj  =  6.  From  (15)  the  slots  corresponding  to  {0,  frac{2(p~^),  frac{Z(p~^)}  are  assigned  to 

class  1;  the  slots  corresponding  to  {frac{4ip~^),  •  •  •,  frac{6(p~^)}  are  assigned  to  class  2  and,  lastly, 
the  slots  corresponding  to  {frac{7(p~^),  •  ■  • ,  frac{12ip~^)}  are  assigned  to  class  3.  Consequently  we 
can  write  A13  =  {0,  frac{2(p~^),  ■  ■  • ,  frac{l2(p~^)}.  Evaluating  the  individual  terms  in  A13  and 

rearranging  then  in  an  increasing  order  (recalling  that  the  nth  smaUest  element  corresponds  to  the 
nth  slot)  we  obtain  the  following  polling  sequence:  1-^2— >3— >1— >3— >3— »2— +3— >1— >2— » 
3  ^  1  3. 

The  Round  Robin  Policy  In  the  Roimd  Robin  (RR)  policy,  assuming  M  transmission 
sets,  slots  cire  assigned  to  each  transmission  set  according  to  the  following  simple  sequence:  1  — > 
2-+...— >M-+1— >2— >•••— If  we  define  a  frame  of  length  M  slots,  then  RR 
assigns  each  transmission  set  exactly  1  slot  in  every  M  slots.  RR  scheduling  policies  in  the  context 
of  polling  system  have  been  well  researched  and  have  been  shown  to  be  optimal  (over  the  class 
of  all  admissible  policies)  for  systems  with  symmetric  traffic  and  non-overlapping  transmission 
sets  [22,  15].  We  conjecture  a  similar  result  for  overlapping  transmission  sets.  In  particular  the 
superiority  of  RR  over  random  polling,  under  symmetric  traffic,  can  be  attributed  to  the  higher 
variance  of  the  intervisit  time  (time,  in  slots,  between  successive  visits  to  a  transmission  set)  in 
random  polling.  However  as  we  deviate  from  symmetric  traffic  conditions,  RR  rapidly  deteriorates 
in  performance,  whereas  random  polling  is  fairly  ‘robust.’  The  last  observation  is  true  since  rsindom 
polling  will  adapt  to  the  changing  traffic  and  allocate  slots  to  reflect  the  change.  The  optimized 
GR  policy  may  be  thought  of  as  an  effort  to  combine  the  variance  properties  of  the  frame  structure 
with  the  adaptive  nature  of  random  polbng  in  order  to  design  a  frame  which  reflects, the  optimal 
slot  allocation  probabilities. 

The  Maximal  Traffic  Scheduling  Policy  The  Maximal  Traffic  Scheduling  Policy  [MTP) 
is  a  variant  to  the  optimal  GR  policy  where,  rather  than  using  the  optimal  Ofs  to  construct  the 
frame,  we  construct  a  frame  such  that  the  frequency  of  visits  to  a  particular  transmission  set  is 
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a  reflection  of  the  ‘maximal  traffic’  at  that  transmission  set.  Equivalently  MTP  assigns  priorities 
based  on  the  maximum  arrived  rate  to  a  transmission  set.  For  example,  if  in  Figure  lb  we  let  the 
arrival  rates  be  Ai  =  0.1,  A2  =  0.2,  A3  =  0.3,  A4  =  0.2,  then  the  maximed  traffic  at  transmission 
set  1  is  max(Ai,A2)  =  0.2;  similarly  for  transmission  sets  2  and  3  the  maximal  traffic  is  0.3. 
Consequently  the  desirable  fraction  of  slots  to  be  assigned  to  transmission  sets  1,2  and  3  is  |,| 
and  I  respectively.  Using  the  same  procedure  as  for  the  GR  (see  (15))  the  MTP  polling  sequence 
is:  l->3^2->3-»2^1->3^2. 

MTP  is  an  inherently  conservative  policy  in  that  it  ignores  the  overlapping  structure  and 
allocates  slots  assuming  that  if  an  arrival  stream  belongs  to  a  particidar  transmission  set,  then 
all  packets  from  that  stream  are  served  in  that  transmission  set.  The  efficiency  of  MTP  is  con¬ 
tingent  on  the  arrival  rates  of  overlapping  arrival  streams  (streams  that  belong  to  more  than  one 
transmission  set).  For  example,  if  the  maximal  arrival  rate  in  each  transmission  set  corresponds 
to  a  non- overlapping  stream,  then  MTP  incurs  no  penalty  by  ignoring  the  overlapping  structme. 
Alternatively,  if  the  maximal  rate  corresponds  to  an  overlapping  stream,  then  MTP  assigns  surplus 
slots  to  that  transmission  set  at  the  expense  of  increasing  the  mean  intervisit  time  for  streams  not 
belonging  to  that  set.  Finally,  if  the  maximal  arrival  rate  is  the  same  for  each  transmission  set  then 
MTP  reverts  to  RR. 

A  comparison  of  random  polling  (as  developed  in  previous  sections),  RR,  optimal  GR  (where 
the  optimization  is  based  on  the  slot  assignment  probability  vector  0*  determined  through  our  ap¬ 
proach),  and  MTP  is  presented  in  [7]  for  various  system  parameter  (arrival  rates  and  distributions). 
The  conclusion  drawn  is  that  in  each  case  GR  performs  better  than  RR  or  random  polling;  however 
random  polling,  even  tmder  optimal  slot  assignments,  does  not  always  perform  better  than  RR. 
The  last  observation  is  attributed  to  the  higher  variance  of  the  intervisit  time  in  random  polling. 

5  Scheduling  in  Networks  with  Voice  Traffic 

For  the  remainder  of  the  report  we  focus  on  scheduling  voice  calls  in  an  A-node  network  with  (for 
simplicity)  N  non-overlapping  (distinct)  transmission  sets.  The  amalysis  readily  extends  to  the  case 
with  overlapping  transmission  sets.  In  this  model,  ‘transmission  set’  is  synonymous  with  ‘node’ 
and  thus,  for  brevity,  the  latter  terminology  is  adopted.  Recadl  that  voice  traffic  is  delay- sensitive 
traffic  in  the  following  sense:  voice  packets  cannot  be  queued  for  more  than  n  time  slots,  where  n 
is  fixed.  Thus,  upon  arrival,  a  voice  call  is  accepted  if  it  can  be  assigned  a  transmission  slot  within 
the  call  setup  time^  otherwise  it  is  blocked.  Once  a  call  is  accepted,  then  due  to  the  cychc  nature  of 
the  poUing  poficy,  we  can  guarantee  that  packets  belonging  to  this  call  are  assigned  a  transmission 
slot  every  n  time  slots.  We  consider  a  cyclic  policy,  where  the  poUing  cycle  is  specified  by  a  frame 
comprised  of  n  slots,  such  that  a  slot  is  uniquely  assigned  to  a  particular  transmission  set.  Thus,  a 
frame  is  characterized  by  a  vector  [cj ,  •  •  • ,  c^]  where  Ci  €  { 1 ,  •  •  ■ ,  A}  is  the  transmission  set  (node) 
assigned  to  slot  i.  Our  objective  then  is  to  determine  the  optimal  frame  that  minimizes  the  average 
call  blocking  probability. 

The  delay- sensitive  blocking  model  can  be  used  to  model  voice  call  processing  in  an  A-node 
multihop  RN  [10,  24].  When  processing  voice  traffic  with  circuit  switching,  a  voice  caU  arriving 

®In  this  report  the  call  setup  time  is  defined  as  the  maximum  duration  a  call  can  be  queued  before  a  decision  to 
accept/reject  the  call  is  made. 
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at  a  node  reserves  (for  the  duration  of  the  voice  call)  the  desired  bandwidth  at  each  node  along  a 
multihop  path.  If,  upon  arrival  of  a  voice  call,  bandwidth  is  unavailable  at  any  of  the  intermediate 
nodes  required,  the  call  is  blocked.  This  is  also  referred  to  as  ‘voice  admission  control’,  a  problem 
also  considered  in  [1]  using  a  model  not  based  on  slotted  time,  but  rather  frequency  division 
multiplexing  to  overcome  the  problem  of  channel  access  conflicts.  In  the  context  of  our  slotted-time 
model.  Time  Division  Multiplexing  (TDM)  is  employed  to  provide  contention-free  multiplexing  of 
the  bandwidth.  Moreover,  the  duration  of  a  time  slot  is  determined  by  the  bandwidth  requirement 
of  each  node  (for  simphcity,  we  assume  that  the  bandwidth  requirements  are  the  same  at  all  nodes). 

In  the  general  TDM  framework,  a  frame  consisting  of  n  time  slots  is  defined  so  that  each  slot 
is  assigned  to  a  voice  call  (or  several  voice  calls  belonging  to  the  same  transmission  set).  The  size 
of  the  frame  (i.e.,  the  number  of  slots  n)  is  chosen  so  that  proper  voice  trcinsnndssion  is  ensured. 
The  frame  is  repeated  in  time  so  that  the  controller  assigns  the  server  to  each  transmission  set  in 
accordance  with  the  frame  structure  Thus,  a  call  is  blocked  if  it  cannot  be  assigned  a  slot  in  the 
TDAf  frame.  The  design  of  the  frame  then  represents  a  means  of  controlling  the  performance  of  the 
system  in  terms  of  the  blocking  probability  of  czills.  In  the  remainder  of  this  section  we  describe 
the  call  arrival  process  in  more  detail,  and  define  two  modes  of  delay-sensitive  operation,  termed 
slot  delay  blocking  and  frame  delay  blocking. 

We  refrain  from  providing  a  specific  model  for  voice;  rather,  we  assume  that  sampling  auid 
encoding  are  such  that  at  most  one  packet  is  submitted  for  transmission  per  frame  by  any  individual 
call.  This  guarantees  that  if  a  call  is  accepted,  then  no  packet  from  that  cadi  will  be  queued  for 
service.  We  can  now  describe  our  model  as  follows.  At  each  node  z,  i  =  1, ...,  A,  there  is  a  process 
characterizing  voice  cadi  arrivals.  We  assume  an  airbitrairy  interarrival  distribution  for  calls,  and 
allow  for  correlations  between  call  arrivals.  The  jth  call  at  node  i  is  chairacterized  by  the  pair 
where  AJ  is  the  call  arrival  epoch  and  rjj  is  the  call  duration,  specified  as  a  number  of 
frames.  The  discrete  random  variable  ij'  has  an  arbitrary  distribution.  Equivalently  rjj  can  be 
viewed  as  the  number  of  packets  in  the  jth  node  i  call.  Note  that  A‘-  and  77*  do  not  have  be  known 
apriori.  The  decision  to  accept  or  reject  a  call  is  made  independent  of  the  call  duration,  and 
moreover,  once  the  call  is  accepted,  it  is  guaranteed  one  slot  per  frame  for  the  duration  of  the  call. 

Next,  we  describe  the  operation  of  the  system  from  node  i’s  point  of  view;  the  operation  is 
the  same  from  the  point  of  view  of  any  other  node.  The  server  is  assigned  to  nodes  based  on  a  frame 
consisting  of  n  slots  (as  already  mentioned,  n  depends  on  the  characteristics  of  the  delay- sensitive 
traffic  we  are  serving).  Each  slot  is  assigned  to  some  node.  Thus,  from  node  i ’s  perspective,  the 
frame  consists  of  slots  assigned  to  i  {transmission  slots)  and  slots  assigned  to  some  other  node 
{vacation  slots).  Suppose  a  frame  is  such  that  slots  are  assigned  to  node  i.  For  any  time  instant, 
let  fi,  0  ^  /t  tli6  number  of  trsmsmission  slots  not  committed  to  any  ongoing  node  i  call. 

Then,  when  a  call  {Aj,T]))  is  submitted  to  the  system,  the  call  is  admitted  if  fi  >  0,  in  which  case  /,• 
is  decremented  by  1;  otherwise,  the  call  is  blocked  and  considered  lost.  Moreover,  this  call  reserves 
the  slot  it  is  assigned  for  the  next  rj)  frames,  i.e.,  if  the  jth  caU  is  accepted  and  begins  using  the  kth 
slot  of  the  mth  frame  on  the  time  line,  then  it  reserves  the  kth  slot  for  the  {(m-h  1),  •  •  • ,  (m  +  T/*-  - 1)} 
frames.  These  slots  are  now  considered  unavailable  to  future  voice  calls  arriving  at  i. 

The  model  is  complete  once  we  specify  the  call  setup  time,  i.e.,  how  long  an  arriving  call 
can  wait  for  an  accept /reject  decision.  We  consider  two  variants  on  the  call  setup  time.  In  the 
first  variant,  henceforth  referred  to  as  Slot  Delay  (SD)  blocking,  an  arriving  caU  is  queued  imtil  the 
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beginning  of  the  next  slot  (vacation  or  transmission)  only,  resulting  in  a  call  set  up  of  one  time 
slot.  A  call  is  therefore  blocked  if  within  one  slot  duration  of  the  call  arrival  epoch  an  available 
transmission  slot  is  not  assigned  to  node  i  (recall  that  a  transmission  slot  is  deemed  available  if 
it  has  not  been  reserved  by  any  prior  voice  caU).  This  is  true  regardless  of  whether  there  are  einy 
transmission  slots  actually  available  in  the  frame.  If  the  caU  is  accepted,  it  proceeds  to  reserve  slots 
in  subsequent  frames  as  described  in  the  preceding  paragraph. 

Alternatively,  in  Frame  Delay  (FD)  blocking,  the  decision  to  accept  calls  is  made  at  the 
beginning  of  each  frame  (for  a  maximum  call  setup  of  2n  —  1  slot  durations^®);  all  calls  that  arrive 
during  a  frame  are  therefore  queued  up  to  the  beginning  of  the  next  frame.  At  that  time,  those 
calls  that  cannot  be  assigned  an  available  transmission  slot  are  blocked.  As  before,  if  caU  (A‘-,77*) 
is  accepted,  it  reserves  the  slot  it  is  assigned  for  77*-  frames.  Note  that  this  model  aUows  for  an 
arbitrary  selection  of  which  caUs  at  the  beginning  of  any  frame  wiU  be  blocked,  whereas  in  SD 
queuing  the  First-In-First-Out  property  is  always  preserved.  It  is  also  worth  pointing  out  that,  in 
FD  blocking,  if  there  are  fi>0  available  transmission  slots  when  a  caU  arrives,  the  precise  slot 
identity  in  the  frame  which  is  assigned  to  this  call  does  not  affect  the  performance  of  the  system 
in  terms  of  future  blocked  calls. 

6  Discrete  Optimization  Problem  Formulation 

We  now  formulate  the  scheduling  problem  as  an  optimal  slot  aUocation  problem.  Let  Cj  denote  the 
node  assigned  to  slot  j  in  a  frame  consisting  of  n  slots.  Then,  let  S  be  the  finite  set  of  feasible 
frames  where; 

5  =  {[ci,---,c„]  :  Cj- e  {I,---,  A}} 

By  “feasible”  we  mean  that  the  frame  must  be  chosen  to  satisfy  some  basic  requirements,  such  as 
that  at  least  one  slot  is  assigned  to  each  node.  For  any  s  G  5,  let  Ji{s)  —  A[Li(s)]  be  the  node  i 
cost  function,  where  as  before  Li(s)  is  the  sample  node  i  cost.  Then,  our  objective  is  to  select  s 
G  5  so  as  to  solve  the  foUowing  discrete  optimization  problem: 

Problem  P3: 

N 

t  =  l 

where  /3,-  is  the  weight  associated  with  node  i.  This  is  generaUy  a  difficult  problem,  as  it  involves 
both  optimization  over  a  discrete  set  and  estimation  of  the  total  cost  under  aU  possible  frames  s  G  5. 
Rather  than  attempting  to  solve  it,  we  transform  (P3)  into  a  continuous  parameter  optimization 
problem  as  foUows. 

6.1  Transforming  the  Discrete  Optimization  Problem  into  a  Continuous  Opti¬ 
mization  Problem 

Let  6i  =  P[cj  =  i],  independent  of  the  slot  index  j  =  l,...,n.  We  can  then  construct  a  frame 
through  a  randomization  mechanism  based  on  a  probabihty  vector  6  =  [61, ...,  0jv],  with  = 

*®The  maximum  call  setup  corresponds  to  a  voice  call  that  arrives  just  after  the  beginning  of  a  frame  and  is  assigned 
the  last  time  slot  in  the  subsequent  frame. 
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1,  whereby  each  slot  is  independently  assigned  to  node  i  with  probabiHty  6i.  In  this  setting,  6 
represents  the  control  parameter  which  can  be  optimcdly  selected  so  as  to  minimize  the  desired  cost 
function.  In  particular,  we  replace  problem  (P3)  by  the  following  constrained  continuous  parameter 
optimization  problem: 

Problem  P4: 


N 

mjn^/3,-.E[Z,,((9)] 

*  i=i 


s.t. 


N 

i=l 


The  relationship  between  the  slot  assigrunent  probabihty  vector  6  and  the  actual  frame  s  can  be 
made  precise  as  follows.  Let  r  =  [ri,  •  •  • ,  r„]  be  cin  n-dimensional  vector  of  independently  generated 
random  numbers  with  every  rj  uniformly  distributed  in  [0,1].  A  frame  s(0,r)  =  [ci,  •••,£„]  is 
obtained  using  the  standard  inverse  transform  technique  for  generating  random  variates  cj,  ...,£„ 
as  follows  [16]: 

i 

Cj  =  min{i  :  Tj  <Y^ek,  (16) 

ife=i 

It  is  then  clear  that,  for  a  given  r,  by  adjusting  the  parameters  6i,  i  =  we  can  ciffect  Cj, 

j  =  1,  and,  consequently,  control  the  design  of  the  frame.  From  a  notational  standpoint,  it 
will  be  useful  to  distinguish  between  a  frame  generated  through  (16),  which  is  denoted  by  s(^,r), 
and  a  frame  denoted  by  s,  which  is  simply  an  element  of  the  set  5. 

The  obvious  advantage  of  replacing  the  discrete  optimization  problem  (P3)  by  the  continuous 
optimization  problem  (P4)  is  the  fact  that  the  latter  may  be  solved  through  standard  nonlinear 
programming  techniques  (e.g.,  using  a  Lagrangian  relaxation  approach).  Moreover,  if  closed- form 
expressions  for  7,(-)  are  avadlable,  (P2)  is  reduced  to  solving  a  system  of  algebreiic  equations.  If, 
on  the  other  hand,  such  expressions  are  not  available,  then  we  can  still  resort  to  standard  gradient- 
based  optimization  algorithms  (hiU-chmbing  or  steepest  descent).  Thus  we  need  to  derive  estimators 
similar  to  those  developed  in  [7]  and  in  section  2. 

Since  our  goal  then  is  to  estimate  dE[Li{0)]/d6i  from  a  single  sample  path,  we  need  to 
generate  a  sample  path  of  RN  operation  in  a  stochastic  environment.  To  do  so,  we  need  to  specify 
a  frame  s{6,  r).  Assuming  that  a  control  vector  B  is  specified,  then  a  sample  path  is  characterized  by 
the  vector  r  and  we  write  in  (P4)  E[Li[6)]  =  F^[L,(s(0,  r))].  Consequently, we  see  that  the  solution 
of  (P4)  is  dependent  on  the  initial  selection  of  r,  which  is  subsequently  held  fixed  throughout  the 
optimization  procedure.  If  we  repeat  the  process  with  a  new  initial  r,  then,  in  general,  a  different 
optimal  point  6'  is  attained.  A  solution  to  (P4)  is  therefore  characterized  by  the  pair  {B^,rk)  where 
the  subscript  refers  to  the  A:th  optimization  run.  However,  we  claim  that,  the  frame  s{6'^,rk)  is 
fixed  for  cill  k,  and,  imder  certain  conditions,  it  is  also  the  solution  to  (P3). 

In  the  next  section  we  adopt  the  Marked  Slot  methodology  proposed  in  [4]  in  order  to  obtain 
an  unbiased  estimate  of  for  any  node  i  =  Note  that  Ji[  )  is  a  function  of  6i  only, 

i.e.,  the  fraction  of  calls  blocked  at  node  i  depends  only  on  the  fraction  of  slots  assigned  to  i.  We 
can  therefore  concentrate  on  an  isolated  node  i.  Recall  that  a  frame,  from  node  z’s  perspective,  is 
comprised  of  transmission  and  vacation  slots;  therefore  a  frame  s  =  [ci,  ...,c„]  can  be  represented 


7  Derivative  Estimation  through  the  Marked  Slot  Approach 

In  this  section  we  use  the  concept  of  marking  (as  defined  earlier;  see  section  2)  transmission  slots  in 
order  to  obtain  imbiased  derivative  estimates  based  on  observation  of  a  single  sample  path  of  our 
system.  The  fundamental  difference  between  marking  a  slot  in  a  purely  random  pohcy  as  opposed 
to  a  frame-based  one  is  that  in  the  latter  the  marking  of  a  slot  in  a  given  frsime  also  results  in  the 

marking  of  the  same  slot  in  every  subsequent  frame.  We  refer  to  the  j -marked  slot  system  as  the 

sample  path  generated  when  the  jth  transmission  slot  in  the  nominal  frame  is  marked.  Let  us  now 
concentrate  on  the  specific  performance  measure  of  interest  to  our  problem,  i.e.,  let  Ji{6i)  be  the 
expected  fraction  of  calls  blocked  at  node  i  after  K  call  arrivals,  which  we  will  denote  by  Pi.  Define 
the  following  sample  path  quantities  for  each  node  i: 

•  bi  [6i(j)]  -  Total  number  of  blocked  calls  at  node  i  in  the  nominal  [j-marked  slot]  system 

•  a,-(j)  -  Total  number  of  node  i  calls  accepted  in  the  jth  transmission  slot  of  any  frame. 

•  ai[k)  -  Number  of  call  cirrivals  at  node  i  during  the  kth  freime 

•  di{k)  -  Number  of  call  completions  at  node  i  during  the  kth  frame 

•  fi{k)  -  Number  of  transmission  slots  available  to  node  i  in  the  kth  frame  (i.e.,  slots  assigned 
to  i  which  are  not  currently  used  by  a  node  i  call) 

Then,  we  consider  the  sample  function  L{6i,u})  =  bifK^  with  Pi  =  E[bi]/K.  Note  that  we  limit 
ourselves  here  to  sample  paths  defined  by  K  call  arrivals  at  node  i.  Under  standard  ergodicity 
conditions,  we  expect  that  as  K  —>  oo,  L  converges  a.s.  to  the  steady-state  blocking  probability  at 
node  i.  With  the  notation  defined  above  we  have  the  following  theorem  (the  proof  can  be  found  in 

[5]): 
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Theorem  2  An  unbiased  estimate  of  ^  is  given  by: 


\fL  = 

In  summary,  (19)  requires  us  to  calculate  the  change  in  performance  when  the  jth  (for  all  j  G  A) 
transmission  slot  is  marked  in  the  nominal  frame.  In  the  remainder  of  this  section  we  present 
expressions  for  evaluating  this  change,  in  terms  of  quantities  directly  observable  on  the  nominal 
sample  path,  first  operating  under  SD  blocking  and  then  FD  blocking.  Again,  for  brevity,  we  will 
present  only  the  final  resvdt  and  omit  the  actual  derivation. 


7.1  Derivative  Estimation  with  Slot  Delay  Blocking 

Recall,  that  the  effect  of  marking  the  jth  slot  in  the  first  frame  is  to  mark  the  jth  slot  in  every 
subsequent  frame.  The  consequences  of  such  a  coupled  marking  is  that  all  calls  that  were  originally 
accepted  in  the  jth  transmission  slot  of  any  frame  in  the  nominal  system,  wiU  now  be  blocked 
in  the  j-marked  system.  Therefore,  the  change  in  number  of  node  i  blocked  calls  when  the  jth 
transmission  slot  is  marked  is  given  by  cti{j)  or  the  total  number  of  node  i  ceiUs  accepted  in  the  jth 
transmission  slot  of  any  frame.  Thus  we  have  the  following  result: 


Theorem  3  An  unbiased  estimate  of  is  given  by: 


.  tBt 


bj/K  -  1 


The  proof  can  be  foimd  in  [5]. 

Note  that  since  the  estimator  in  (20)  is  unbiased,  it  follows  that 

£i  ^  „  \bi/K  -  1  - 

d6i  6i 

The  first-order  differential  equation  in  (21)  can  be  solved  to  obtain  the  linear  equation  for  Pi: 

=  mdi  +  1  (22) 

where  m  is  the  slope,  an  unbiased  estimate  of  which  is  given  by  (20).  The  ramification  of  this 
result  is  that  (P4)  is  now  transformed  into  a  linear  prc>graming  problem  and  can  be  efficiently 
solved.  It  is  interesting  to  note  that  this  result  is  easily  derived  through  the  “perturbation-like” 
argument  given  above,  which  is  completely  independent  of  the  call  arrival  process  or  the  distribution 
characterizing  call  durations.  It  is  an  example  of  a  simple  saniplc-path-type  argument  that  can  be 
used  not  to  derive  explicit  derivative  estimates,  but  to  establish  a  functional  relationship  between 
a  performance  metric,  Pi,  and  a  parameter  of  interest,  6,. 


7.2  Derivative  Estimation  with  Frame  Delay  Blocking 

The  primary  difference  between  slot  marking  with  SD  and  FD  blocking  is  that,  in  the  latter, 
marking  a  transmission  slot  does  not  necessarily  imply  that  a  call,  if  accepted  in  the  marked  slot, 
will  be  blocked  in  the  perturbed  system.  Define  a  “tapped”  call  to  be  a  call  that  is  accepted  in 
the  nominal  but  not  in  some  j-marked  sample  path.  Then,  by  definition,  Abi{j)  is  given  by  the 
total  number  of  tagged  calls  over  the  observation  interval.  Therefore  if  we  can  monitor  the  tagging 
process  then  we  can  derive  the  appropriate  estimator.  Before  proceeding  further  we  define 

K 

where  l(-)  is  the  standard  indicator  function.  Thus,  Ui  is  simply  the  number  of  slots  allocated  to 
node  i  in  some  frame  s.  We  shall  now  make  the  following  simplifying  assumption  for  our  problem: 

•  Al:  Li{s)  depends  only  on  the  number  of  slots  assigned  to  node  i,  i.e.,  Li{s)  =  Li{ni). 

Clearly,  Al  is  justified  when  the  sample  fimction  is  the  blocking  probability  tmder  FD  blocking. 
However,  the  assumption  would  not  hold  in  cases  where  the  identity  of  the  slots  assigned  to  node 
i  affects  that  node’s  cost  function.  For  example,  if  the  sample  function  were  the  individual  node 
mean  waiting  time,  then  clearly  the  latter  is  a  function  of  the  position  of  the  transmission  slots 
within  the  frame,  and  not  just  of  the  number  of  such  transmission  slots.  Lastly,  note  that  as  a 
consequence  oi  A 1  there  can  be  at  most  one  tagged  call  in  the  nominal  system  at  any  instant. 

To  formalize  the  process  of  call  tagging,  we  define  the  binsiry  variable  Zi(fc),  where  we  use 
Zi{k)  =  1  to  denote  the  fact  that  there  is  a  tagged  call  at  the  beginning  of  the  kth.  frame.  The 
dynamics  of  the  tagging  process  are  then  given  by: 

1  if  [a, (A:)  >  fi{k)  +  di(A:)]  and 

[(z,(A:)  =  0)  or  (tagged  call  departs  in  frame  k)] 

0  if  (tagged  call  departs  in  frame  k)  and  [a, (A:)  <  fi{k)  +  di(A:)] 

Z{(A:)  otherwise 

with  initial  condition  2i(l)  =  0.  We  claim  that  (for  details  see  [5]): 

Theorem  4  An  unbiased  estimate  of  ^  is  given  by: 

F 

-jr  =  ^  /*(*^)  +  [(^t(^)  =  0)  or  (tagged  call  departs)] 

J  est 

where  F  is  the  number  of  frames  contained  in  a  sample  path  defined  by  K  call  arrivals  at  node  i. 

Finally,  a  reduced- variance  estimator  can  be  obtained  by  exploiting  the  fact  that  6i  =  E\A\/n. 
Thus,  the  estimator  we  finally  use  (which,  in  general,  will  no  longer  be  unbiased)  is 

F 

V  l[a,(A:)  >  fi{k)  +  cfi(A:)]l  [(z£(A:)  =  0)  or  (tagged  call  departs)]  (23) 
For  the  remainder  of  the  report  we  focus  on  the  optimization  problem  assuming  FD  blocking. 


z,(A:  -f  1)  = 
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8  Convergence  to  a  Global  Optimum 

Before  presenting  our  main  result,  let  us  summarize  our  approach  by  means  of  the  following  opti¬ 
mization  scheme  (Si).  For  brevity,  we  denote  the  estimate  in  (23)  by  Di. 

1.  Initially  select  some  r  and  Hence,  determine  an  initial  frame  =  s(0(°\r)  through 
(16). 

2.  Observe  a  sample  path  imder  and  estimate  the  derivatives  for  all  i  =  1,  •  •  • ,  iV. 

3.  For  any  m  =  0, 1,  •  •  •,  iterate  as  follows: 

^(m  +  l)  ^  (24) 

=  Tnin{i:rj  (25) 

fc=i 

with  some  provisions  to  ensure  that  =  1  foi'  all  =  0, 1,  •  • 

4.  Observe  a  sample  path  rmder  s{6^^\r)  to  estimate  the  derivatives  for  all  i  =  1,  •  •  ■ ,  IV, 
and  repeat  step  3  until  we  satisfy  stopping  condition. 

Of  course,  (24)  is  a  standard  stochastic  approximation  scheme  driven  by  the  derivative  esti¬ 
mates  DJ”  with  {J7m}  an  appropriately  selected  step  size  sequence.  There  are  several  important 
observations:  first  after  an  update  in  the  auxiliary  vector  the  frame  is  also  updated  using 

the  transformation  defined  in  (16).  Second,  the  iterative  scheme  is  terminated  when  we  reach  a 
stopping  condition,  and  not  necessarily  when  6^'s  have  converged.  In  particular,  the  stopping 
condition  is  defined  as  one  where 

max{£)j'"\  i  =  1,  •  •  • ,  N} 

is  minimized  over  the  iteration  index  m.  Finally,  note  that  r  remains  fixed  throughout  the  process. 
For  more  details  on  the  algorithm  the  reader  is  referred  to  [6]. 

In  addition,  we  make  the  following  assumption  regarding  the  cost  functions  of  interest: 

•  A2:  For  all  t  =  1,  •■,  N ,  Li{ni)  is  such  that  Li{ni  4-  1)  —  Li[ni)  >  Li{ni)  -  Li{ni  -  1). 

We  can  now  establish  the  following  result.  Let  us  first  consider  a  specific  sample  path  and  apply 
the  optimization  scheme  (Si).  Let  6’  be  the  probability  vector  when  the  stopping  condition  is  met 
in  (24),  and  let  s(0*,r)  be  the  corresponding  frame.  This  frame  is  an  optimal  one  in  the  discrete 
space  5,  i.e.,  s(0'',r)  =  s',  where  s“  is  independent  of  r  and  of  the  sequence  m  =  0,1,  ••• 

which  leads  to  the  stopping  point  in  (24). 

Theorem  5  Under  assumptions  A1-A2,  the  optimization  scheme  (S2)  applied  to  a  particular  sam¬ 
ple  path  of  the  underlying  system  yields  a  frame  which  is  optimal  in  the  discrete  space  S . 

The  proof  can  be  found  in  [6]. 

We  next  present  two  applications  of  this  approach  to  voice  call  processing  in  RNs.  We  first 
consider  the  optimal  frame  design  to  minimize  blocking  in  a  fully  connected  RN.  In  the  second 
application,  we  consider  the  optimal  admission  control  problem  in  a  multihop  RN. 


22 


9 


Optimal  TDM  Frame  Design 


This  application  is  motivated  by  the  need  to  schedule  voice  call  transmissions  (broadcast  mode) 
in  an  JV-node  Radio  Network  [24].  In  particular  consider  a  3-node  RN  and  for  simphcity,  assume 
that  all  nodes  are  neighbors  of  every  other  node  i.e.,  the  network  is  fuUy  connected.  The  stochastic 
discrete  optimization  problem  is  then  to  select  the  optimcil  frame  s*  to  minimize  X)i=i  A-ft(s)  where 
Pi{s)  is  the  node  i  blocking  probabihty  operating  under  FD  blocking. 

We  employ  a  gradient  based  optimization  algorithm  (see  [17]),  where  the  parameters  of  in¬ 
terest  are:  (a)  the  observation  interval  t*,  (specified  in  terms  of  the  number  of  calls  arrivals)  over 
which  the  derivatives  dPi/dOi  are  estimated,  and  (b)  the  step  size  rjk  which  is  used  to  control  the 
adjustment  at  the  A:th  update.  In  the  algorithm  considered  here  we  assume  a  constant  step  size 
77  =  1.0  and  increasing  observation  interval  lengths.  In  particular,  =  Tjt_i  -)-  C  with  Ti  =  1000  sind 
C  =  200  node  1  caU  arrivals.  At  the  end  of  the  I:th  observation  interval  we  compare  individual  node 
derivatives  and  set  i'  =  argmin,[dPi/d0i]  and  j"  =  argmaxi[dPi/d0,].  The  slot  assignment  proba- 
bihties  are  adjusted  so  as  to  increase  the  probability  associated  with  i*  and  reduce  the  probability 
associated  with  . 

In  the  simulation  results  shown  below,  we  assume  a  Tl  frame  with  n  =  24  slots,  and  a 
symmetrically  loaded  system  with  Poisson  arrivals  (A,-  =  0.2  V  i  =  1,  ••,3)  and  fixed  csdl  durations 
of  5  frames.  Thus,  even  though  this  system  is  difficrdt  to  analyze,  it  follows  from  the  symmetry 
that  the  optimal  frame  is  [8,8,8].  We  considered  three  single-run  optimization  experiments  (with 
different  randomization  vectors  ri,  r2,  ra).  In  each  case,  the  initial  slot  assignment  probability 
vector  is  =  [0.7,0.15,0.15].  Table  1  shows  the  initial  and  final  frames,  as  well  as  the  final  slot 
assignment  probabihties,  where  the  latter  two  are  indexed  by  i,  the  number  of  iterations  on  6  imtil 
we  first  reach  the  optimal  frame.  After  this  point  we  oscillate  between  the  optimal  and  one  of 
its  neighbors.  The  oscillatory  behavior  is  typical  of  most  gradient-based  iterative  schemes  and  in 
practice  it  is  easy  to  identify  this  situation  and  select  the  true  optimal  allocation. 


Exp. 

s(°) 

r 

X 

1 

(18,3,3) 

rs 

(8,8,8) 

[0.391,0.261,0.348] 

29 

2 

(20,3,1) 

ri 

(8,8,8)  1 

[0.292,0.197,0.511] 

30 

3 

(21,1,2) 

r2 

(8,8,8) 

[0.263,0.185,0.552] 

38 

Table  1:  Frame  Optimization  for  a  3-node  RN 


10  Optimal  Admission  Control  in  Blocking  Networks: 

In  this  apphcation  we  determine  the  optimal  admission  control  pohcy  to  minimize  blocking  in 
networks  with  fixed  routing  (specified  as  Source/Destination  pairs).  Each  S/D  pair  can  be  viewed 
as  a,  typically  multihop,  circuit  where  the  ith  such  circuit  is  denoted  by  c^.  We  consider  a  4-node 
network  with  5  S/D  pairs  (see  Figure  4a).  We  assume  homogenous  resources  at  each  node,  in 
particular  each  node  in  the  network  has  n  transceivers,  and  such  that  a  transceiver  can  be  used 
for  simultaneously  transmitting  and  receiving  a  packet.  In  tandem  queue  networks  (see  Figure 
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Figure  4a:  4-Node  Muldhop  RN 


Figure  4b;  3  Class  Polling  Model 


4),  when  establishing  a  circuit,  we  assume  that  a  call  needs  to  reserve  a  transceiver  at  the  source 
and  intermediate  nodes  only,  aind  not  at  the  destination  node,  which  does  not  have  to  transmit^^. 
Therefore,  in  Figure  4,  multihop  calls  simultaneously  reserve  the  resource  at  two  nodes  (e.g.,  for  a 
C4  call  we  reserve  a  transceiver  at  nodes  1  and  2),  whereas  all  other  calls  only  reserve  transceivers 
at  one  (source)  node. 

CaU  admission  is  concerned  with  the  decision  to  accept  or  reject  a  call.  We  consider  a  simple 
threshold  based  caU  admission  policy  where  a  ‘ci  voice  call’  is  accepted  if  and  only  if  there  are  less 
than  ti  calls  currently  active  over  circuit  i.  In  addition  we  require  the  pohcy  to  be  work  conserving, 
i.e.,  a  Ci  call  is  ailways  assigned  a  transceiver  if  the  threshold  is  not  exceeded.  Finally  the  thresholds 
ti  are  chosen  so  as  not  to  violate  the  physical  constraints  (limited  number  of  transceivers  per  node) 
of  the  system.  The  latter  can  be  expressed  as  linear  constraints  on  the  thresholds.  The  optimization 
problem  is  then  to  select  the  values  of  t,-  so  as  to  minimize  the  system  performance  subject  to  the 
system  constraints 


+  t4  =  n 
^2  "I"  ^4  +  ^5  = 

^3  +  ^5  = 

To  solve  for  the  optimal  thresholds,  we  convert  the  tandem  queueing  model  to  a  multiclass  polling 
model.  Let  the  transceivers  be  sequentially  indexed,  and  observe  that  cj  and  C5  calls  can  be 
assigned  the  same  transceiver  since  their  respective  paths  do  not  overlap.  Therefore,  by  analogy 
with  the  polling  model,  ci  and  cs  constitute  a  transmission  set.  Similarly  we  can  define  two  more 
transmission  sets  as  Tz  =  {ci,  02,03}  and  T3  =  {03,04}.  The  polling  model  is  shown  in  Figure  4b. 
Our  objective  is  to  select  the  optimal  frame  s*  =  [TiJ,n2,n5]  so  as  to  minimize  the  average  network 
blocking,  where  Uj  is  the  number  of  trainsceiver  assigned  to  treinsmission  set  i. 

Finally,  observe  that  the  threshold  associated  with  any  call  type  is  given  by  the  total  number 
of  transceivers  assigned  to  that  call  type.  That  is, 

ti  =  53 

fc  :  ien 

Therefore,  the  optimal  thresholds  are  given  by  the  following: 


^2  —  ^2 

^4  ~  ”3 

m 

tg  — 

tl  =  nj  +  n\ 

tg  =  0.2  +  Tlj 

We  have  performed  2  sets  of  single-rim  optimization  experiments.  In  each  case,  the  initial  slot 
assignment  vector  is  =  [0.4, 0.2,  0.4]  and  the  system  is  observed  for  50  parameter  updates.  We 
employ  a  gradient  based  optimization  algorithm  with  a  constant  step  size  rjk  =  0.25  and  increasing 

'*We  are  in  effect  assuming  that  receivers  are  readily  available,  and  that  only  transmitters  are  a  scarce  resource. 
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observation  interval.  In  particular  r*.  =  Tk-i  +  C  with  C  —  200  calls.  The  system  parameters  for 
the  experiments  axe  given  as: 

Parameter  Set  1:  24  Transceivers  per  Node 

•  Poisson  Arrivals:  Ai  =  A3  =  0.1;  Aj  =  A4  =  A5  =  0.2 

•  Call  Durations:  7)4  =  t]s  =  5]  171  =  f/?  =  »?3  =  UN  1(1,9) 

•  Initial  Observation  Interval  (tq)  2000  Call  Arrivals 

Parameter  Set  2:  24  Transceivers  per  Node 

•  Poisson  Arrivals:  A,-  =  0.2  Vj  =  l, .  .,5 

•  Call  Durations:  7^4  =  7^5  =  5;  —  UN  1(1,9) 

•  Initial  Observation  Interval  (tq):  2000  CaU  Arrivals 

Define  the  load  on  circuit  i  to  be  p,  =  Xi/fj,  where  rji  =  l/Vi-  In  experiments  1  and  2  we  assume 
24  transceivers  per  node  (equivalently  Tl  links  between  nodes)  and  consider  both  asynometric  sind 
symmetric  traffic  in  the  network  respectively.  Moreover,  for  both  parameter  sets  we  assiime  that  C4 
and  cj  calls  have  constant  duration  (5  frames),  whereas  aU  remaining  call  durations  cire  uniformly 
(recall  that  we  permit  only  integral  frame  lengths)  distributed  with  mean  5. 

Tables  2  -  6  give  the  system  blocking,  frame  design  and  the  corresponding  slot  assignments 
at  the  nth  iteration,  for  the  aforementioned  parameter  sets.  For  each  parameter  set  we  consider 
several  initial  frame  configurations  corresponding  to  different  r  vectors.  Prom  the  results  we  can 
make  several  observations;  first  for  blocking  networks  with  synunetric  traffic  (i.e.,  p,-  is  the  same 
for  each  circuit),  the  optimal  policy  is  to  never  accept  a  C4  or  C5  call  (see  Table  5  -  6)^^.  The 
corresponding  optimal  thresholds  are  therefore  =  <3  =  <3  =  t4  =  t^  =  0.  This  is  to  be 
expected  since  both  C4  and  C5  calls  reserve  a  transceiver  at  two  nodes  along  the  multihop  path, 
whereas  aU  other  calls  reserve  only  one  transceiver.  Therefore  by  accepting  a  C4  call  (and  thereby 
reserving  a  trcinsceiver  at  nodes  1  and  2)  we  are  automaticaUy  blocking  a  ci  and  C2  call,  whereas 
by  blocking  a  C4  call  we  have  the  possibihty  of  accepting  two  cabs.  Now  since  since  each  csdl  is  of 
the  same  priority,  and  the  arrival  rates  are  the  same,  the  optimal  pohcy  is  never  to  accept  C4  cabs. 
A  similar  argument  hold  for  C5  cabs.  However  once  the  load  is  different  on  different  circuits,  the 
optimal  pobcy  is  no  longer  to  always  reject  multihop  cabs  (see  Table  1-3).  Secondly,  9  is  optimized 
conditioned  on  an  initial  vector  r.  Thus,  we  observe  that  although  in  Tables  1  eind  3  the  initial 
frame  is  common.  O'"  is  different  for  each  case.  However  as  is  conjectured,  the  optimal  frame  is  the 
same  for  each  pair  (9l,rk). 

'^We  conjecture  that  this  observation  holds  true  irrespective  of  the  actual  value  of  the  load.  Therefore,  even  when 
pi  is  sufficiently  small  such  that  the  uncontrolled  (i.e.,  subject  to  no  admission  control)  network  can  sustain  aD  offered 
traffic,  the  optimal  oplicy  is  to  never  accept  a  c<  or  c^  call. 
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Conclusions  and  Future  Work 


We  have  considered  the  transmission  scheduling  problem  in  an  N-  node  Radio  Network  with  ho¬ 
mogeneous  traffic  (exclusively  voice  or  data).  The  scheduling  problem  is  modelled  as  a  multiclass 
polling  problem  where  a  single  resource  must  provide  service  to  M  predefined  transmission  sets. 
Customers  arrive  in  one  of  several  arrival  streams,  and  a  customer  from  a  particular  arrival  stream 
is  allowed  to  join  one  of  several  prespecified  transmission  sets  associated  with  that  stream.  The 
traffic  Grade-of- Service  requirements  lead  to  different  scheduling  polices,  in  particular,  we  consider 
a  random  poUing  policy  for  data  traffic  and  a  cyclic  polling  policy  for  voice  traffic.  We  develop 
derivative  estimation  algorithms  and  use  them  in  conjunction  with  a  gradient-based  optimization 
scheme  to  adaptively  alter  the  schedule  (in  a  stochastic  operating  environment)  to  achieve  the 
desired  system  performance.  An  important  feature  of  the  proposed  methodology  is  that  it  is  in¬ 
dependent  of  the  nature  of  the  node  arrival  processes  (and  arrival  rate  information,  except  for 
checking  that  no  stability  requirements  are  violated).  Finally,  for  the  case  of  data  trciffic,  we  use 
the  optimal  slot  assigmnent  probabilities  to  construct  a  cyclic  or  deterministic  scheduhng  pohcy, 
using  the  Golden  Ratio  policy  [15]. 

In  the  context  of  voice  call  scheduling,  the  proposed  optimization  scheme  can  be  viewed  as  a 
new  approach  to  solve  a  class  of  resource  allocation  problems  with  combinatoriaUy  hard  stochastic 
features.  Also,  the  imderlying  ideas  for  derivative  estimation  using  marking  and  phantomizing  tech¬ 
niques  apply  to  general  performance  measures  -  not  just  the  waiting  times  or  blocking  probabilities. 
Therefore,  estimates  of  more  general  performance  measures  (e.g.,  probabihty  that  a  customer  delay 
exceeds  some  deadline,  or  probability  that  the  queue  length  at  a  node  exceeds  some  value)  can 
also  be  obtained.  This  raises  the  interesting  possibility  of  investigating  scheduling  problems  invol¬ 
ving  traffic  classes  with  different  performance  reqtdrements,  an  issue  of  considerable  importance  in 
networks  designed  to  support  voice,  video  and  data  (see  [24]). 

Another  natmal  next  step  for  om  work  is  to  address  the  problem  of  routing  in  multihop 
radio  networks.  This  c2Ln  be  done  by  defining  appropriate  transmission  sets  and  scheduling  packet 
transmissions  accordingly.  Although  this  problem  becomes  considerably  more  difficult  when  the 
network  combines  both  voice  and  data,  we  believe  the  same  basic  approach  described  in  this  report 
can  be  applied. 

Finally,  it  is  obviously  of  great  interest  to  investigate  distributed  implementations  of  the 
various  algorithms  we  have  presented  for  transmission  scheduling.  We  believe  this  is  indeed  possible, 
but  have  left  this  research  direction  for  future  work. 
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Experiment  1:  Tl  Link  with  Asymmetric  Traffic: 
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Table  2:  Optimization  of  4-node  Tandem  RN 
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Table  3:  Optimization  of  4-node  Tandem  RN 
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Table  4:  Optimization  of  4-node  Tandem  RN 
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Experiment  2:  Tl  Link  with  Symmetric  Traffic: 
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Table  5:  Optimization  of  4-node  Tandem  RN 
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Table  6:  Optimization  of  4-node  Tandem  RN 
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